Tuning clustering in random networks with arbitrary degree distributions 
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We present a generator of random networks where both the degree-dependent clustering coeflicient 
and the degree distribution are tunable. Following the same philosophy as in the configuration model, 
the degree distribution and the clustering coefficient for each class of nodes of degree k are fixed 
ad hoc and a priori. The algorithm generates corresponding topologies by applying first a closure 
of triangles and secondly the classical closure of remaining free stubs. The procedure unveils an 
universal relation among clustering and degree-degree correlations for all networks, where the level 
of assortativity establishes an upper limit to the level of clustering. Maximum assortativity ensures 
no restriction on the decay of the clustering coefficient whereas disassortativity sets a stronger 
constraint on its behavior. Correlation measures in real networks are seen to observe this structural 
bound. 
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I. INTRODUCTION 

Models in complex networks science aim to repro- 
duce some common empirical statistical features ob- 
served across many different real systems, from the In- 
ternet to society [ll Q, Q ■ Many of those models are able 
to recreate prominent recurrent attributes, such as the 
small-world property and scale-free degree distributions 
with characteristic exponents between 2 and 3 as mea- 
sured for networks in the real world. Other characteris- 
tics, such as the presence, the shape, and the intensity 
of correlations, are also unavoidable in models intend- 
ing to help us to understand how these complex systems 
self-organize and evolve. 

The first reference to correlations in networks appear- 
ing in the literature is the clustering coefficient , which 
refers correlations among three vertices. The clustering is 
a measure of transitivity which quantifies the likelihood 
that two neighbors of a vertex are neighbors themselves. 
Then, it is a measure of the number of triangles present 
in a graph. In addition to the empirical evidence that the 
vast majority of real networks display a high density of 
triangles, the concept of clustering is also relevant due to 
the fact that triangles are -together with edges- the most 
common building blocks taking part in more complex but 
elementary recurring subgraphs, the so-called motifs ^^J. 
It has been argued that networks large-scale topological 
organization is closely related to their local motifs struc- 
ture 0] so that these subgraphs could be related to the 
functionality of the network and can be fundamental in 
determining its community structure 0, 1^ • 

All these mean that a correct quantification and mod- 
eling of the clustering properties of networks is a matter 
of great importance. However, most modeling efforts be- 
yond the degree distribution have focused in the repro- 
duction of two point correlations patterns, typified by the 
average nearest neighbors degree , so that clustering is 



just obtained as a byproduct. In most synthetic net- 
works, it vanishes in the thermodynamic limit, but, as to 
many other respects, scale-free networks with divergent 
second moment stand as a special case. The decay of 
their clustering with the increase of the network size is 
so slow that relatively large networks with an appreciable 
high cohesiveness can be obtained Nevertheless, it 
remains to be an indirect effect and no control over its 
intensity or shape is practicable. Therefore, an indepen- 
dent modeling of clustering is required and a few grow- 
ing linear preferential attachment mechanism have been 
suggested. One of the proposed models reproduces 
a large clustering coefficient by adding nodes which con- 
nect to the two extremities of a randomly chosen network 
edge, thus forming a triangle. The resulting network has 
the power-law degree distribution of the Barabasi- Albert 
model P{k) ~ , with {k) = 4, and since each new 
vertex induces the creation of at least one triangle, the 
model generate networks with finite clustering coefficient. 
A generalization on this model [T^ which allows to tune 
the average degree to (fc) = 2m, with m an even inte- 
ger, considers new nodes connected to the ends of to/2 
randomly selected edges. Two vertices and three vertices 
correlations can be calculated analytically through a rate 
equation formalism. The clustering spectrum is here fi- 
nite in the infinite size limit and scales as k^^. 

Those models do not allow much freedom in the form of 
the resulting clustering coefficient, neither in the ensuing 
degree distribution, so that, although a valuable first ap- 
proach, they constitute a timid attempt as clustering gen- 
erators. In this paper, we make headway by introducing 
a generator of random networks where both the degree- 
dependent clustering coefficient and the degree distribu- 
tion are tunable. After a brief review of several cluster- 
ing measures in section II, the algorithm is presented in 
section III. In section IV, we check the validity of the 
algorithm using numerical simulations. Section V is de- 
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voted to the theoretical explanation of the constraints 
that degree-degree correlations impose in the clustering. 
We find that assortativity allows higher levels of clus- 
tering, whereas disassortativity imposes tighter bounds. 
As a particular case, we analyze this effect for the class 
of scale-free networks. We end the section by examin- 
ing some empirical networks, finding a good agreement 
with our calculations. Finally, conclusions are drawn in 
section VI. 



II. MEASURES OF CLUSTERING 

Several alternative definitions have been proposed over 
time to quantify clustering in networks. The simplest 
measure is defined as 0, Q 



computed on the basis of the degree distribution P{k) as 



3 X Number of triangles 
Number of connected triples 



(1) 



This scalar quantity does not give much information 
about local properties of different vertices because it just 
counts the overall number of triangles regardless of how 
these triangles are placed among the different vertices of 
the network. 

The clustering coefficient, first introduced by Watts 
and Strogatz provides instead local information and 
is calculated as 



2T,; 



(2) 



where Ti is the number of triangles passing through ver- 
tex i and h is its degree. The average of the local cluster- 
ing coefficients over the set of vertices of the network, C, 
is usually known in the literature as the clustering coeffi- 
cient. Watts and Strogatz were also the first in pointing 
out that real networks display a level of clustering typ- 
ically much larger than in a classical random network 
of comparable size, Crand — {k) /N, with (fc) the aver- 
age degree and N the number of nodes in the network. 
Although C and Ca are sometimes taken as equivalent, 
they may be very different, even though both measures 
are defined in the interval [0, 1]. 

With the definition of Ci we have gone to the other 
extreme of the spectrum -from a global to a purely local 
perspective- so that we have highly detailed information. 
One can adopt a compromise between the global property 
defined by C, or Ca, and the full local information given 
by a by defining an av erag e of a over the set of vertices 
of a given degree class that is. 



c(fc) 
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iGT(fc) 
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fc(fc - l)Nk 



E 2^., (3) 
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where Nk is the number of vertices of degree k and T(fc) 
is the set of such vertices. The corresponding scalar mea- 
sure is called the mean clustering coefficient and can be 



= j2pimk), 



(4) 



which must not be confused with the clustering coefficient 
C = c/(l - P(0) - In fact, we have implicitly 

assumed that c{k = 0) = c{k = 1) = whereas in the 
definition of C we only consider an average over the set 
of vertices with degree k > 1. this fact explains the 
difference between both measures. 

In the case of uncorrelated networks, c{k) is indepen- 
dent of k. Furthermore, all the measures collapse and 
reduce to C [Hill [H. 



c(fc) = Ca = C = 



1 i<k^ > ~ <k>f 



N 



fc > I. (5) 



Therefore, a functional dependence of c{k) on the degree 
can be attributed to the presence of correlations. Indeed, 
it has been observed that c(fc) exhibits a power-law be- 
havior c{k) ~ k~" (typically < a < I) for several real 
scale-free networks. Hence, the degree dependent cluster- 
ing coefficient has been proposed as a measure of hierar- 
chical organization and modularity in complex networks 

Recently, a new local clustering coefficient has been 
proposed, which filters out the bias that degree-degree 
correlations can induce on that measure 12(11 



(6) 



where LUi is the maximum number of edges that can be 
drawn among the ki neighbors of vertex i. This new mea- 
sure does not strongly depend on the vertex degree, re- 
maining constant or decreasing logarithmically with the 
increase of k when computed for several real networks. 



III. THE ALGORITHM 

In this paper, we develop and test a new algorithm 
that, in the same philosophy of the classical configuration 
model (CM), generates networks with a given degree dis- 
tribution and a preassigned degree dependent clustering 
coefficient c(fc), as defined in Eq. 

The CM has been one of the most successful algorithms 
proposed for network formation |^ . The relevance 
of the algorithm relies on its ability to generate random 
networks with a preassigned degree sequence -taken from 
a given degree distribution- at the user's discretion while 
maximizing the network's randomness at all other re- 
spects. The algorithm became relevant as soon as more 
real networks were analyzed and proved to strongly de- 
viate from the supposed Poisson degree distribution pre- 
dicted by the classical model of Erdos and Renyi |23l[24l |. 
Ever since, the CM has been extensively used as a null 
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model in contraposition to real networks with the same 
degree distribution. 

One of the well-known properties of the CM is that 
clustering vanishes in the limit of very large networks (see 
Eq. (jsj) and, thus, it clearly deviates from real networks, 
for which clustering is always present. In general, a high 
level of clustering may change the percolation properties 
of the network, alter its resilience in front of removal of 
its constituents, or affect the dynamics that take place on 
top of them. Since such processes inextricably entangle 
topology and functionality, it would be very interesting to 
have at one's disposal an algorithm that generates clus- 
tered networks in a controlled way so that one can check 
which is the real effect of transitivity on its topological 
and dynamical properties. 

With this purpose, we introduce an undirected un- 
weighted static model where the total number of nodes 
in the network, N, remains constant, as in the case of 
the CM. The algorithm comprises three different parts: 
1) Assignment of a degree to each node and assignment 
of a number of triangles to each degree class according to 
the expected distributions. 2) Closure of triangles and 
3) closure of the remaining free stubs. In what follows, 
we give a detailed description of the algorithm. 

1) Degree and clustering from expected distribu- 
tions 

• An a priori degree sequence is chosen according to 
a given distribution P{k), so that each vertex is 
awarded an a priori number of connections in the 
form of a certain number of stubs. 

• An a priori clustering coefficient c{k) is also fixed, 
so that each class of nodes of degree k is assigned 
an a priori number of triangles '3q| . Note that the 
number of triangles is fixed for the whole class and 
not for the particular vertices of the class. This is a 
key point of the algorithm because fixing the num- 
ber of triangles to each single vertex would impose 
a number of constraints that would make nearly 
impossible to close the network. 

• All nodes begin with a number of associated edges 
and all degree classes begin with a number of 
associated triangles. 

2) Triangle formation 

First some preliminary remarks. Both stubs and edges 
can be selected to form a triangle. Stubs are half links 
associated to one node, and edges are entire links associ- 
ated to two nodes, and thus have double probability with 
respect to stubs to be selected to participate in a trian- 
gle. Let us define the set of eligible components (EC) 
as the set of free stubs and edges associated to nodes 
belonging to degree classes with a number of triangles 
below its expected value (unsatisfied classes). Stubs and 
edges of nodes in satisfied classes should not be in the set 
of EC. Edges of nodes which cannot form more triangles 



(with only edges as components and neighbors without 
stubs) should not be in the set of EC. Stubs and edges of 
nodes with only one component should not be in the set 
of EC. Notice that the set of EC changes dynamically as 
triangles are formed. 

The algorithm then proceeds by choosing three differ- 
ent nodes and forming a triangle among them whenever 
it is possible and it did not exist previously. The selection 
of the nodes is performed hierarchically as follows. 

• For the first node, a degree class ki among the 
ones with unsatisfied number of expected triangles 
is chosen with a certain probability distribution, 
n(fc), not necessarily uniform (the specific form for 
this function and the motivation to introduce it will 
be discussed at the end of the section and a theoret- 
ical explanation is given in section 0. Then, the 
node is selected through a component which is cho- 
sen with uniform probability within the subset of 
eligible components in the class, EC(fci). A second 
different component of the same node is selected. 

• If the two chosen components are edges, and the 
second and the third nodes at the end of the edges 
still have free stubs, the triangle is formed by merg- 
ing one free stub of the second node and one free 
stub of the third node (see Fig. 




FIG. 1: The two selected components of the first node, 
marked 1, are edges. The triangle if formed by connecting 
nodes 2 and 3, whenever they have free stubs. 

• If one component is a stub and the other an edge, a 
third node is necessary. First, a new component is 
selected for the second node at the end of the edge. 
If it is an edge, the triangle is formed by merging 
one free stub of the first node and one free stub of 
the third node. If it is a stub, then a third node is 
chosen in the same way as the first one under the 
condition that it has two free stubs. The triangle is 
then formed by merging these two free stubs with 
the ones of the first and second nodes (see Fig. [2J|. 

• It may happen that the two components of the first 
node are stubs. Then, a new node is selected in the 
same way as the first one under the condition of 
having at least one free stub, and a second compo- 
nent is also chosen for this second node. If both 
components of the second node are stubs, a third 
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FIG. 2: The two selected components of the first node are 
one edge and one stub. A second component is chosen for the 
second node. On the left side of the figure, the component is 
an edge whereas on the right side it is a free stub. 




node with two free stubs is selected and the trian- 
gle is closed. If one component of the second node 
is a stub and another an edge, the node at the end 
of the edge will be the third node and the triangle 
is formed linking stubs between the first and the 
second node and the first and the third node (see 
Fig. 01). 
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FIG. 3: The two selected components of the first node, 
marked 1, are stubs. 



• After each triangle is formed, all dynamic quanti- 
ties are updated: linked stubs are converted into 
edges and the corresponding number of new trian- 
gles is added to all involved degree classes. It is 
worth to mention that not only one more triangle 
is computed for the classes of the nodes forming 
the triangle, but the degree classes of simultaneous 
neighbors of pairs of those nodes may also be af- 
fected if those pairs were not previously connected. 
The set EC is also updated, removing components 
of nodes in new satisfied classes as well as nodes 
with only one component or nodes which cannot 
form more triangles. 

• This process is repeated until all classes are satisfied 
or there are no more components in the eligible set. 

3) Closure of the network 

The final step consists in the closure of the network 
by applying the classical configuration model to the 
remainder stubs. Pairs of these stubs are selected 
uniformly at random and the corresponding vertices are 
connected by an undirected edge. 



FIG. 4: Clustering coefficient for networks generated by the 
algorithm using a Poisson degree distribution with average 
degree (k) — 4 and expected clustering coefficient c(fc) = 
co{k - (solid lines) with a = 1, 0.7, 0.4, and co = 0.5 

in all cases. Each curve is an average over three different 
realizations with a network size of A'^ = 10^. The parameter 
l3 is equal to I for a = 1 and a — 0.7, and (3 = 0.5 for a = 0.4. 



In this way, the algorithm is able to reproduce networks 
with a given degree distribution, P{k), and a given clus- 
tering coefficient, c(fc), as long as the assortativity -that 
is, positive degree-degree correlations- is high enough to 
avoid constraining c(fc). This is by no means a deficiency 
of the algorithm but an universal structural constraint 
imposed by the degree-degree correlation pattern of the 
network. In general, with the maximum assortativity one 
can reproduce any desired level of clustering, whereas dis- 
assortative networks have instead a bounded clustering 
coefficient. A theoretical explanation is given in section 
V. 

In our algorithm, the level of assortativity is controlled 
by the probability by which the degree class is chosen pre- 
viously to the selection of the node. This can be done in 
a number of different ways. In our case, we tune the as- 
sortativity by choosing a proper form for the probability 
n(fc). For instance, an uniparametric function modeling 
different assortativity levels is given by n(/c) oc Tr{k)^ , 
where Tr{k) is the number of triangles remaining to be 
formed in the degree class fc in a given iteration. The 
value of f3 typically ranges in the interval [0, 1], generat- 
ing more assortative networks as P approaches 0. 



IV. NUMERICAL SIMULATIONS 

To check the feasibility and reliability of the algorithm, 
we have performed extensive numerical simulations, gen- 
erating networks with different types of degree distribu- 
tions and different levels of clustering. The chosen forms 
for the degree distribuion are Poisson, exponential, and 
scale-free. The degree dependent clustering coefficient is 
chosen to be c(fc) = co{k — The numerical pref- 
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FIG. 5: The same as in Fig|3]for an exponential degree dis- 
tribution. In this case, the parameter /3 is 1 for a = 1 and 
a = 0.7, and /3 = for a = 0.4. 
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FIG. 6: The same as in Fig. ^ior an scale-free degree distri- 
bution of exponent 7 = 3. In this case, the parameter /3 is 1 
for a = 1 and a = 0.7, and /3 = 0.2 for a = 0.4. 



actor is set to cq = 0.5 and the exponent takes values 
a — 1, 0.7, and 0.4. The size of the generated networks 
is = 10^ and each curve is an average over three dif- 
ferent realizations. 

Simulation results are shown in Figs. ^[S] andl^l which 
correspond to Poisson and exponential degree distribu- 
tion with average degree (k) — 4, and scale-free degree 
distributions with exponent 7 = 3, respectively. As it 
can be seen, the degree dependent clustering coefficient 
is well reproduced in all cases just by decreasing the value 
of P if necessary (the values of P used in each simulation 
are specified in the caption of the corresponding figure) . 
The standard procedure we follow is to start with /3 = 1 
and to check whether the tail of c{k) is well reproduced. 
If not, we decrease its value until the entire curve fits the 
expected shape. 

Fig. [7| shows the degree distributions generated by the 
algorithm for the simulations of the previous figures, con- 
firming that, indeed, the generated degree distributions 



FIG. 7: Degree distributions generated by the algorithm 
(symbols) as compared to the expected ones (solid lines). 



match the expected ones. 



V. CLUSTERING VS. DEGREE-DEGREE 
CORRELATIONS 

As we advanced in the previous section, degree-degree 
correlations constraint the maximum level of clustering a 
network can reach. A naive explanation for this is that, 
if the neighbors of a given node have all of them a small 
degree, the number of connected neighbors (and hence, 
the clustering of such node) will be bounded. This is the 
main idea behind the new measure of clustering intro- 
duced in [2OJ. However, we can make a step forward and 
quantify analytically this effect. To do so, we need to 
define new quantities which take into account the prop- 
erties of vertices that belong to the same triangle. Let 
us define the multiplicity of an edge, rriij, as the num- 
ber of triangles in which the edge connecting vertices i 
and j participates. This quantity is the analog to the 
number of triangles attached to a vertex, Ti. These two 
quantities are related through the trivial identity 



E 



2T,,. 



(7) 



which is valid for any network configuration. The matrix 
aij is the adjacency matrix, giving the value 1 if there is 
an edge between vertices i and j and otherwise. 

It is possible to find a relation between multiplicity, 
degree distributions and clustering. Summing the above 
equation for all vertices of a given degree class we get 



fe' iGT(fc) ieT(fe') ieT(fc) 

Now, there are some key relations which can be used 



*eT(fe) jeT(fe') 



kk' 



(9) 
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where rukk' is the average multiphcity of the edges con- 
necting the classes k and fc', and E^k' is the number 
of edges between those degree classes. Finally, taking 
into account Eq. |(2Jl and the fact that the joint degree 
distribution satisfies P{k,k') — limjv^oo -Bfefe'/(fc)iV, we 
obtain the following closure condition for the network 



Y,mkk'Pik,k') = kik-l)c{k) 



m 
(k) ■ 



(10) 



Let us emphasize that this equation is, in fact, an identity 
fulfilled by any network and, thus, it is, for instance, at 
the same level as the degree detailed balance condition 
derived in 25]. These identities are important because, 
given their universal nature, they can be used to derive 
properties of networks regardless their specific details. As 
an example, in we used the detailed balance condition 
to prove the divergence of the maximum eigenvalue of the 
connectivity matrix that rules the epidemic spreading in 
scale-free networks, which, in turn, implies the absence 
of epidemic threshold in this type of networks. 

The multiplicity matrix is, per se, a very interesting 
object that gives a more detailed description on how tri- 
angles are shared by vertices of different degrees. In prin- 
ciple, rrikk' does not factorize and, therefore, non trivial 
correlations can be found. The global average multiplic- 
ity of the network, m, can be computed as 



y^^y^^ mkk'Pjk, k') = 



{k{k - l)c(fc)} 

W) ■ 



(11) 



Values of fh close to zero mean that there are no tri- 
angles. When m « 1, triangles are mostly disjoint and 
their number can be approximated as T(k) w fc/2, and, 
when ?fi ^ 1, triangles jam into the edges, that is, many 
triangles share common edges. 

We are now equipped with the necessary tools to ana- 
lyze the interplay between degree-degree correlations and 
clustering. The key point is to realize that the multiplic- 
ity matrix satisfies the inequality 



rrikh 



< min(fc, k') 



1, 



(12) 



which comes from the fact that the degrees of the nodes 
at the ends of an edge determine the maximum number of 
triangles this edge can hold. Multiplying this inequality 
by P{k, k') and summing over k' we get 

k{k - mk)^ < E i^in(fc, k')Pik, k') 

\'^) k' ^ ' 

(13) 

where we have used the identity Eq. (|10|l . This inequal- 
ity, in turn, can be rewritten as 



1 

%k) < 1 - Y.^'^- k')P{k'\k) EE A(fc). 



(14) 



fe'=i 



Notice that A(fc) is always in the interval [0, 1] and, there- 
fore, c(fc) is always bounded by a function smaller (or 



equal) than 1. 
reads 



In the limit of very large values of k, Eq. 



c(fc) < A(fc) 



Kin{k) 1 



(15) 



where fc^„(A:) is the average nearest neighbors degree of 
a vertex with degree k. The superscript r (of reduced) 
refers to the fact that it is evaluated only up to k and, 
therefore, fcj^„(fc) < k. For strongly assortative networks 
kmiik) ^ k, so that A(fc) ~ C(l) and there is no restric- 
tion in the decay of c{k). In the opposite case of disas- 
sortative networks, the sum term in the right hand side 
of Eq. H14|) may be fairly large and then the clustering 
coefficient will have to decay accordingly. 

In FiglHl we show this effect by changing the level of 
clustering while keeping the degree-degree correlations 
unchanged by fixing the value of /3 to /3 = 1. As it can 
be seen, lower levels of clustering are better reproduced. 
However, the clustering collapses to a limiting curve when 
the expected value crosses it. That is, any function c(fc) 
is possible whenever it is defined below a limiting curve 
which is a function of the degree correlation pattern of 
the network. 
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FIG. 8; Clustering coefficient for Poisson like degree distri- 
butions, using P — 1. Different curves correspond to differ- 
ent values of the prefactor cq. Doted lines are the expected 
clusterings whereas symbols are the ones generated by the al- 
gorithm. The solid line is a guide for the eye of the limiting 
curve. For lower values of the prefactor, the expected value 
can be fitted in a wider region. Notice that all curves collapse 
into the same limiting curve, which indicates the intrinsic con- 
straint Eg.ltT^. 

Another way to see the same effect is shown in Fig. 
1^1 In this case we keep the expected clustering while 
changing the assortativity of the network by tuning the 
parameter f3. As it can be seen, as correlations become 
more and more assortative (decreasing values of (3) the 
expected clustering can be further reproduced. 

We would like to point out that the function A(fc) is just 
an upper bound for the clustering coefficient. The actual 
bound will probably be even smaller due to the fact that 
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FIG. 9: Average nearest neighbors degree (top), knn{k), and 
clustering coeSicient (bottom), c{k), for a power law degree 
distribution with exponent 7 = 3 using two different levels of 
assortativity, P — 1 and /3 — 0.2. As we increase assortativity 
the expected clustering can be fitted in a wider region. The 
solid line is the expected clustering c{k) — 0.5(fc — 1)~° *. 



we have only considered the restriction over one edge 
and the degrees of the corresponding vertices. A more 
accurate estimation would involve more than one edge 
and the corresponding vertices attached to them . 



A. Scale-free networks 

Scale-free networks belong to a special class of net- 
works which deserve a separate discussion. Indeed, it 
has been shown that, when the exponent of the degree 
distribution lies in the interval 7 G (2, 3] and its do- 
main extends beyond values that scale as N-^^^, disas- 
sortative correlations are unavoidable for high degrees 
[H m 1^, "291. Almost all real scale-free networks ful- 
fill these conditions and, hence, it is important to ana- 
lyze how these negative correlations constraint the be- 
havior of the clustering coefficient. Let us assume a 
power law decay of the average nearest neighbors degree 
of the form knn{k) ^ nk~^ . One can prove that this 
function diverges in the limit of very large networks as 
knn{k) ~ (fc^) ^ fcp ~^, where fee is the maximum degree 
of the network |26| . Then, the prefactor k must scale in 
the same way which, in turn, implies that the reduced 




FIG. 10: Clustering c{k) versus the maximum value A(A:) for 
several real networks. In all cases, empirical measures fall 
below the diagonal line, validating the inequality Eq. (|14^ . 



average nearest neighbors degree behaves as 

Knik)-k'-'-'. (16) 

Then, from Eq. H15() the exponent of the degree depen- 
dent clustering coefhcient, a, must verify the following 
inequality 



a>j + S -2. 



(17) 



Just as an example, in the case of the Internet at the 
Autonomous System level j^li the reported values for 
these three exponents {a = 0.75, 7 — 2.1, and S = 0.5) 
satisfy this inequality close to the limit {a — 0.75 > 
7-^(5-2 = 0.6). 



B. Real networks 

The interplay between degree correlations and cluster- 
ing can also be observed in real networks. We have mea- 
sured the functions A(fc) and c{k) for several empirical 
data sets, finding that the inequality Eq. H14|l is always 
satisfied. The analyzed networks are the Internet at the 
Autonomous System level (AS) [s^, the protein inter- 
action network of the yeast S. Cerevisiae (PIN) [sj, an 
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FIG. 11: Empirical measures of the ratio between the cluster- 
ing coefficient c(fc) and the maximum value \(k) for different 
real networks. 

intra-university e-mail network 32], the web of trust of 
PGP 01, the network of coauthorships among academics 
[S^ . and the world trade web (WTW) of trade relation- 
ships among countries [s^- 

In Fig. ^1 we plot the clustering coelRcient c(fc) as 
a function of A(fc). Each dot in these figures corre- 
spond to a different degree class. As clearly seen, in all 
cases the empirical measures lie below the diagonal line, 
which indicates that the inequality Eq. H14|l is always pre- 
served. In Fig. ^] we show the ratio c(fc)/A(fc). The rate 
of variation of this fraction is small and, thus, the de- 
gree dependent clustering coefficient can be computed as 
c{k) = X{k)f{k), where f{k) is a slowly varying function 
of k that, in many cases, can be fitted by a logarithmic 
function. 



VI. CONCLUSIONS 

We have introduced and tested a new algorithm that 
generates ad hoc clustered networks with given degree 
distribution and degree dependent clustering coefficient. 
This algorithm will be useful for analyzing, in a controlled 
way, the role that clustering has on many dynamical pro- 
cesses that take place on top of networks. We have also 
introduced a new formalism which backs our algorithm 
and allows to quantify clustering in a more rigorous man- 
ner. In particular, an universal closure condition for net- 
works is found to relate the degree dependent clustering 
coefficient, degree-degree correlations and the number of 
triangles passing through edges connecting vertices of dif- 
ferent degree classes. Using this relation, we have found 
how the correlation pattern of the network constraints 
the function c{k). In particular, assortative networks are 
allowed to have high levels of clustering whereas disas- 
sortative ones are more limited. Overall, we hope that 
a more accurate shaping of synthetic networks will im- 
prove our understanding of real ones. At this respect, we 
believe our algorithm will be useful for the community 
working on complex networks science. 
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